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Abstract 

We seek for lines of minimal distance to finitely many given points in 
f"*) the plane. The distance between a line and a set of points is defined by 

the L p -norm, 1 < p < oo, of the vector of vertical or orthogonal distances 
from the single points to the line. 

UThe known properties of optimal lines are deduced by elementary con- 
siderations and represented using a uniform language for the different 
choices to define the distance from a line to a set of points. 

1 Introduction 

a 

l — 1 This article deals with an elementary problem. We seek for a line that is as 

close as possible to a finite set of points in the plane. The regression line is a 
well-known solution and it is even easy to compute. However, it is only one out 

£T) of many possible answers, since the optimal line depends on the definition of 

the distance between a line and a set of points. 

<N 

• 1.1 Distance between a line and a set of points 

ON 

There are several useful ways to measure the distance between a line and a finite 
set of points. Usually, this is done in two steps: First the distance dj between 
a single point pj and a line is defined. Then these distances are combined to a 
distance between the line and the point set {p\, . . . ,p m }- 

We consider the vertical and the orthogonal distance between a point and 
jjj a line. The vertical distance between a point (xj,yj) and the graph of a linear 

function y(x) — ax + b is defined by dj — \yj — axj — b\. To emphasize that 
the solution should be a graph of a (linear) function this distance is also called 
algebraic distance. The orthogonal distance between a line g and a point pj is the 
length of the line segment connecting pj and g perpendicular to g. To compute 
this distance the euclidian inner product is used. Therefore, this distance is also 
called euclidian or geometric distance. 

The distance between a point set {p\, . . . ,Pm} and a line is defined by the 
L p -norm of the vector (d\, . . . ,d m ) <G R m , 1 < p < oo. For p = 2 we want 
to minimize the sum of squares d\ + . . . + d 2 m of the distances of the single 
points. This is the classical method of least squares. The L 2 -norm is preferred 
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in statistics. For p = 1 and p = oo we want to minimize the sum d\ + . . . + d m 
respectively the maximum max{c?i, . . . , d m } of the distances of the single points. 
These quantities occur in optimization problems. Then the L p -norm is a natural 
generalization. Furthermore, the investigation of the function d\ + . . . + d v m for 
all 1 < p connects the extreme norms (p = 1, p = oo) to the case p = 2. 

1.2 Goals 

The results of this article are known, partially for a long time, (e.g. Q], [3J, [3], 
0) 0i PDi 0) or can be rather easily derived for some distances. But the facts 
are spread and are often formulated according to the needs of the applications. 
In this article the results are presented in a uniform language accessible to a 
broad audience. The results are deduced by elementary considerations that 
require only basic linear algebra and calculus. Therefore, the appendix contains 
the necessary facts on convex sets and convex functions. 

Due to the homogeneous representation one can easily observe how the prop- 
erties of the objective function /, the suitable methods and the set of optimal 
lines change using different definitions of the distance. Here are two examples: 
The algebraic distance leads to convex or even strictly convex functions. Using 
the geometric distance we loose the convexity in one variable but we gain the 
compactness of the domain of definition of /. Differential calculus provides ex- 
plicit formulas of optimal lines for p = 2. For p = 1 and p — oo the function / 
is not even differentiable at all points but it is piecewise linear. So the global 
minimum can be determined comparing finitely many values of /. 

Despite the differences of the distances investigated here there is one basic 
idea behind all the solutions. Properties of optimal lines are obtained observing 
the behavior of / while translating and rotating a line. If / is differentiable, 
then partial differentiation yields a system of equations for the critical points 
of /, else this approach gives at least information about the location of optimal 
lines. 

Moreover, dealing with this elementary optimization problem, that can be 
satisfactory solved by simple arguments in many cases, motivates further ques- 
tions in different fields of mathematics. For example, what are the effects of 
small changes of the given point set or the parameter p. If explicit formulas of 
the optimal lines are not available, then one needs fast algorithms to manage 
the input data (p — 1 and p = oo) or to numerically approximate the solution 
(p 1,2, oo). It is even more challenging to fit other objects such as circles 
or ellipses to point sets [3]. In these cases the basic questions of existence and 
uniqueness of objects with minimal distance arc already more difficult to answer. 

1.3 Symmetries 

The algebraic and the geometric distance and the associated optimal lines, be- 
have differently regarding coordinate changes. 

The geometric distances are by definition invariant under isometries, these 
are translations, rotations and reflections. The algebraic distances are invariant 
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under translations and reflections in the coordinate axes or a point. 

If the distance is invariant under an (affine) transformation A : M. 2 — > R 2 , 
then a line g has minimal distance to the point set {pi, . . . ,p m } if and only if 
the line A(g) has minimal distance to the point set {A(pi),...,A(p m )}, i.e., 
the set of optimal lines is equivariant with respect to transformations which do 
not change the distance. In addition, the set of algebraically optimal lines is 
equivariant with respect to scalings of the coordinates, i.e., (x,y) h-> (Xx,ny) 
with 0^A,^eK, and the set of geometrically optimal lines is equivariant with 
respect to dilations, i.e. (x, y) ^ (Xx, Xy) with O^Ael. 

It is useful to consider the symmetries of the point set in order to find the 
appropriate definition of a distance for a specific application, to restrict the 
domain of definition of /, or to simplify determination of the optimal line for 
strictly convex functions /, since the optimal line is unique in that case. 

1.4 Point sets with multiplicities 

In this article we work with the standard assumption that the given points 
Pi, . . . ,p m are pairwise distinct, i.e., pj ^ pk for all j ^ k, and m > 2. Inves- 
tigating the algebraic distances we additionally assume that the x-coordinates 
Xi,...x m of the points pj — (xj,yj) T are pairwise distinct. It is easier to 
formulate the results in this context. 

For each distance considered here we discuss how the main statement changes 
if the standard assumption is omitted. Therefore, our set up covers also dis- 
tances which come from weighted norm on (di, . . . , d m ), e.g., Y^JLi with 

nj e N. 

2 Method of least squares - L 2 -norm 

In this section we minimize the i 2 -norm of (d\, . . . , d m ). We consider the func- 
tion /(<?) = Y^ijLi dj- The function / is a quadratic polynomial in the param- 
eters of a line for the algebraic and the geometric distance. Using differential 
calculus we derive explicit formulas of the optimal lines. 

2.1 Linear regression - minimal algebraic L 2 -distance 

Given points pj = (xj,yj) T e M 2 , j = 1, . . . , m, we determine a linear function 
y(x) = ax + b, a, b G K, with minimal algebraic L 2 -distance to the point set 
{pi, . . . ,p m }, i.e., the minimum of the function / : M 2 — > E defined by 

n m 

/(a, b) ■= J^ivj - y{ x 3 )f = ~ ax i - fe ) 2 - 

This approach excludes lines which are parallel to the j/-axis (x = c). Therefore, 
we assume Xj ^ Xk for all j ^ k. In particular, the points pi ■ ■ . ,p m are pairwise 
distinct. 
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2.1.1 Critical points 

The function / is differentiable and 

(tci m 
- 2 ^2(Vj - ax j ~ b ) x h - 2 ^2(Vj - ax j - b ) 
J = l 3=1 

in m m in m 

X! • "X- r < ■ X-^ ■ "X! • 

j'=i 3=1 3=1 j'=i 3=1 

The critical points of / are given by the solution of the following system of 
equations: 

tci rn tci rn m 

!, X r / ■ ''X r ' X a X^ +5m = X^ w 

3=1 j=i 3=1 3=1 3=1 

2.1.2 Convexity 

The function / is convex, because the summands (axj + b — yj) 2 are convex 
functions. These summands are strictly convex if and only if Xj ^ 0. Since the 
inequality Xj ^ holds for at least one index j, the function / is strictly convex. 
This can be checked directly using the Hesse-matrix Hf. 



Hf = [Wl j . U T J )> 



x 



Em 
.7 = 1 



Xi m 



j=l 3 

It follows from the Schwarz inequality that 

2 / \ 2 






where equality holds if and only if x = {x\, . . . , x m ) is a multiple of the vector 
(1, . . . , 1). Since xi, . . . , x m are pairwise distinct, we have Ylj=i x j > an d 
det(if/) > 0. The matrix Hf is strictly positive definite at all points (a, b) e R 2 . 

There exists exactly one critical point, since the function / is strictly convex. 
The function / attains its local and global minimum at this unique critical point. 

2.1.3 Globale minimum 

The solution of the system of linear equations ([I]) gives the global minimum of 
the function / : R 2 — » R. The second equation means that the optimal line 
contains the center of mass of the points p\ , . . . , p m 

p := — Vpj ; = (x, y) T with S = — V x i( y = — V y r (2) 

3=1 3=1 3=1 
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Thus b — y — ax. 

Moving the origin to the center of mass we obtain the new coordinates 
x := x — x, y := y — y, Xj := Xj — x and tjj :— yj — y. System ([!]) transforms into 
the equations a 5Zj=i %j = Sj=i ^jVj an d ^ = for the optimal line y = ax + b, 
since Y!j=i %i = J2T=i Vj = °- Set 

mm m in 

s xx -=J2 i2 j = ^2( x 3 ~ 5 ) 2 ' s *y := ^fyvj = ^2( x j - x){yj - v)- ( 3 ) 

Theorem 1. Let pj — (xj, yj) T . j — 1, • ■ ■ , m, such that Xj ^ Xk for all j ^ k. 
There exists a unique linear {unction y{x) — ax + b with minimal algebraic L 2 - 
distance to the set {pi, . . . ,p m }- This function has slope S xy /S xx and its graph 
contains the center of mass of the set {pi, . . . ,p m }, i.e., 

y(x) = ^(x-x) + y. (4) 
& xx 

Proof. The translation (x,y) h-> (x,y) does not change the slope. □ 

Corollary 1. If the set {pi, . . . ,p m } is invariant under the reflection in the 
line x = x, then S xy = 0. 

Proof. Since the optimal linear function is unique, it is invariant under the 
reflection in the line x — x. The graph of the optimal function is not the line 

x = x. Thus, it is perpendicular to x — x. □ 

Remark 1. Theorem [I] remains true if the condition Xj ^ x^ for all j ^ k is 
weakened to there existence of indices j ^ k satisfying Xj =^ Xj.. However, if 

xi = . . . = x m = x, then the graph of any linear function y(x) = a(x — x) + y 
with a £ R is an algebraically L 2 -optimal line. 

2.1.4 Generalizations 

Similarly, there exists a unique linear function y(x±, . . . , x n ) — Y17=i aiXi + a o 
having minimal algebraic L 2 -distance to a given finite set in K™ +1 . Whenever 
a finite set in the plane has to be approximated by functions which are linear 
in the parameters to be optimized, the method that worked in this section can 
be applied, because the critical points of the convex objective function are the 
solution of a linear system of equations. 

2.2 Minimal geometric L 2 -distance 

Given pairwise distinct points pj G R 2 , j = 1, . . . ,m, we determine lines g C 
M 2 with minimal geometric L 2 -distance to the point set {p%, . . . ,p m }, i-e., the 
minimum of the function / : 1 x S 1 4 1 defined by 

n 

f(c,n) := ^( c ~ (Pj' n )f- 

3=1 
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Here, a line g C R 2 is described by one of its normal vectors n € S 1 and 
a point qo 6 g. The points of the line g are the solutions of the equation 
(q, n) = (qo,n) =:c£l, i.e., g = {q e R 2 : (g, ra) = c}. The geometric distance 
between pj and 5 is given by |c — (jpj, n)\ (see subsection 2.2.1 ). 



2.2.1 Geometric distance between a point and a line 

For qj € g = {q € R 2 : (g, n) = c} the equation d 3 = min gSg ||g— = ||gj — £>j|| 
holds if and only if qj ~Pj -L g. In the plane, the vector pj — g^ is perpendicular 
to the line g if and only if the vector pj — qj is a multiple of the normal vector 
n of the line, i.e., qj — pj = Xn. In that case, qj — Pj + Xn and d(pj,g) = |A|. 
Now, 

c = = (Pj + An,n> = (pj,n) + X\\n\\ 2 = (pj,n) + A, 

since qj = pj + Xn and qj G g. Thus, A = c — (pj,n) = (go — Pj, n). 



2.2.2 Minimum for fixed normal vector 

We fix a normal vector n 6 S 1 and define f(c) := f(c, n). Now 

m / m \ 

/'(c) = 2j2(c-{pj,n)) = 2mc-2 Q] Pj ,n , /"(c) = 2m > 0. 
i=i \j=i / 

Thus, the function / : R -> 1, c 4 /(c), is strictly convex. Every local 
extremum is a global minimum. The equation /'(c) = holds if and only if 

/ 1 m \ 1 m 

c = ( — y^Pj, n ) = (p, n) where p = — Vp r (5) 

\ j=i / i=i 

For any fixed normal vector n the line which contains the center of mass p gives 
the minimum of / : R — > M, c 1— > /(c). 



2.2.3 Elimination of the variable c 

In order to find the minimum of the function / : R 2 — > R, (c, n) 1— » /(c, n), it 
is sufficient to investigate lines that contain the center of mass p. We consider 
/ : S 1 -> R^° defined by 

m mm 

n>-> f((p,n),n) = ^(p-pj^n) 2 = ^(pj,n) 2 = ^2(pjn) 2 where ^ =p-pj. 
3=1 3=1 3=1 

The change of notation (frompj topj = (xj, Vj) T ) corresponds to the translation 
of the origin to the center of mass p. The function f(n) is even, i.e., f(n) = 
f(-n) for all n € S 1 . 



6 



2.2.4 Optimal normal vector 

We use the parametrization n(t) — (cost, sin t) T and consider the function 
h(t) := f(n(tj). Since pjn = n T pj and 



n' (t )=(- o ^ t )=j n(t )withj=(; - 1 



wc obtain 



dh d 

— = Zj^pfnpf-^ ^2Y / n T p J pjJn = 2n T I J^PjPj ] Jn = 2n T SJn 



with 

m s ^ S \ m m m 

S = y, PjPj — [ q q V ) ; &xx = / , % j 1 S xy — ^ Xjijj , S yy = ^ Vj ■ 
3 = 1 V •' W i=l 3 I i=l 

(6) 

It is easy to check that the matrix S is symmetric: 

I rn \ rn m m 

s T - Efctf = E fe T ) T - E fe T ) T pJ = E^J = s 

\j = l J 3 = 1 3 = 1 3 = 1 

The fact Jn _L n yields tf(()=0^iil 5"Jn S'Jn = AJn for a A e R. This 
means that Jn is an eigenvector of the matrix S. The symmetric matrix S is 
diagonalisible. The eigenspaces of S are perpendicular to each other. Thus, n 
is an eigenvector of S if and only if Jn is an eigenvector of S. 

• If the matrix S has two different eigenvalues Ai ^ A 2 , then the associated 
normalized eigenvectors ±m, ±n 2 are the critical points of / : n i-> /(n). 

• If the matrix 5 has a two dimensional cigenspace, then = 0. Thus 
all lines through the center of mass are optimal. 

Using the identity J 2 = — Id we derive 

= = — (2n T SJn) = 2 (SJn) — + n T SJ— 

dt 2 dt y ' y 1 dt dt ) 

= 2 ({Jn) T S T Jn + n T SJJn) = 2 ({JnfSJn - n T Sn) . 

Now, 

h"(±m) = 2(A 2 - Ai) and h" (±n 2 ) = 2(X 1 - A 2 ). 

Therefore, the normalized eigenvectors of the smallest and the largest eigen- 
value correspond to the local minima respectively maxima of f(n). These local 
extrema are global, since f(n) is symmetric. 

Lemma 1. Let E_ be the eigenspace of the smallest eigenvalue of S . 
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• E_ = R 2 if and only if S xy — und S xx = S yy . 

• (1, 0) T G E_ if and only if S xy — und S xx < S yy . 

• I/(1,0) T £ E_, then E_ = R(a,-1) T with 

a=- 2Sxv 



XX Syy + ^ (S XX Syy) 2 + 

Proof. The vector (1,0) T is an eigenvector of 5 if and only if S is a diagonal 
matrix, i.e., S xy = 0. Then (1, 0) T and (0, 1) T are eigenvectors of the eigenvalues 
S xx respectively S yy . 

Let us calculate the eigenvalues of S: 

= det(S - A Id) = (S xx - A) {Syy - A) - S 2 xy 

= A — A (S XX + Syy) + S XX Syy — S xy 
A = \{S XX + Syy) ± li? With D = y/(S xx -Syy)*+4S* y 

If (1,0) T then dim_E_ = 1 and S*^ ^ or > The general 

solution of the linear equation (S — XId)v = with A = (S xx + S yy — D)/2 is 
v = t(a, -1) T with t e R and a = 2S xy /(S xx - S yy + D). □ 

Theorem 2. Let pj € M 2 , j = 1, . . . , m, &e pairwise distinct points. 
A line g C R 2 has minimal geometric L 2 -distance to the set {pi, ■ ■ ■ ,p m } if 
and only if p G g and the eigenspace of the smallest eigenvalue of S contains a 
normal vector of g. 

• If S xy = and S xx = S yy , then any line through p is optimal. 

• If S xy — and S xx < S yv , then there exists a unique optimal line: x = x 

• If S xy ^ or S xx > S y y, then there exists a unique optimal line: 

9 <? 

>Jxx ^yy ' u 

Proof. An optimal line with normal vector (a, — 1) T can be parametrized by 
g = {p + t(l, a) T : t G R}. The slope of this line is a. □ 

Corollary 2. If the point set {pi, . . . ,p m } is invariant under reflection in a 
line g 9 p, then this line g or the line perpendicular to g containing p is a line 
with minimal geometric L 2 -distance to the set {pi, . . . ,p m }. 

Proof. Let I =^ g be an optimal line that contains p and is not perpendicular 
to g. Since / is not invariant under the reflection in g, there exist at least two 
optimal lines. Thus, all lines containing p are optimal. □ 
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Corollary 3. If the point set {pi, . . . ,p m } is invariant under a rotation around 
p through an angle <fi g" Ztt, then S xy — and S xx = S yy . 

Proof. Since <fi 7^ kir for all k € Z , no line is invariant under the rotation. Thus, 
all lines containing p are optimal. □ 

Remark 2. Theorem [2] and its corollaries remain true if the points Pi,... ,p m 
are not pairwise distinct. 



2.2.5 Generalizations 

A natural generalization of the subject in this section is an affine subspace of 
fixed dimension k < n with minimal geometric L 2 -distance to a given finite 
set in E™. The corresponding optimal lines and planes in K 3 are discussed 
in [5J. For k = n — 1 this problem leads to overdetermined systems of linear 
equations that can be treated with total least squares (TLS) techniques. In this 
section the essential information about the point set is stored in the matrix S. 
The decomposition of a similar matrix S reappears in a method of multivariate 
statistics called principal component analysis (PCA). 

2.3 Example an comparison algebraic versus geometric 

Any L 2 -optimal line contains the center of mass of the set {pi, . . . ,p m }- The 
algebraic problem always possesses a unique solution. The geometric problem 
admits a unique solution that is a graph of a linear function if and only if Sxy 7^ 

Or S XX ^> Syy . 

Theorem 3. The algebraic and the geometric L? -optimal lines coincide if and 
only if this line contains all points Pj or S X y — and S xx ^ Syy. 

Proof. If S xy = and Sxx > Syy> then the algebraic and the geometric L 2 - 
optimal lines coincide if and only if 

S x y 2S x y 



^xx ^xx ^yy ~ 



This equation holds if and only if S xy = or 



S X x ^yy ^ 2S XX 

D S XX -\~ Syy 

(S xx Syy) ~\- ^S xy {S xx ~\- Syy^j 

^xx^yy ^xy 

It follows from the Schwarz inequality that 

m m m 

SxxS yy — X 2 jjj > %j&j) 2 = Sl y , 

3=1 3=1 J = l 

where equality holds if and only if Xxj = i/j for a A G R and all j. □ 
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Figure 1: vertical and orthogonal L 2 -optimal lines 



We consider the four points p\ = (0, 0) T , pi — (1, 1) T , pz = (2, 2) T and 
Pi = (3,3/2) T (see Figure [B. Their center of mass is p = (3/2,9/8) T . Now, 
Px = (-3/2,-9/8) T , p 2 = (-l/2,-l/8) T , p 3 = (l/2,7/8) T , p, = (3/2,3/8) T , 
S X x = 5, S yy = 35/16 und S xy = 11/4. The algebraic L 2 -optimal linear function 
is 

/ 3\ 9 11 
y[x) = a \x — - I + - with a = — = 0.55. 



2/8 20 
The geometric L 2 -optimal line is the graph of the linear function 

, , / 3\ 9 88 

y(x) = a \ x — - + - with a = . = 0.61. 

yy ' \ 2J 8 45 + V45 2 + 88 2 

The algebraic and the geometric L 2 -optimal lines intersect at p. 



3 Absolute distance - L -Norm 

In this section we minimize the sum of the distances of the single points pj . We 
consider the function f(g) — Ylj=i dj- This function is differentiable at g if and 
only if dj > for all j, i.e., g contains none of the points Pj. We prove that 
there exists a global minimum of / for the algebraic and the geometric distance. 
The set of all L 1 -optimal lines is described. Explicit formulas of the elements 
of M in terms of p\, . . . ,p m are not available, yet the set M can be completely 
characterized by comparing the values of / at lines containing at least two of 
the points pj. Minimizing the T^-norm can be carried out in < 0(m 2 ) steps. 

3.1 Minimal algebraic L 1 -distance 

Given points pj = (Xj, yj) T € K 2 , j — 1, . . . , m, we determine all linear functions 
y{x) — ax + b, a,b € R, with minimal algebraic L 1 -distance to the point set 
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Figure 2: algebraic decomposition of Figure 3: definition of ct a (j) f° r lines 
the index set, definition of /9 CT (j) through pk 



{pi, . . . ,p m }, i-e., the minimum of the function / : K 2 — > K defined by 

m 

f(a,b) = J2\yj-(<™j+b)\- 



As in subsection 2.1 we additionally assume that Xj ^ Xk for all j ^ fc. 

The function / is continuous, piecewise linear, convex and bounded from 
below. Thus, / admits a global minimum. The set of algebraically L -optimal 
lines M is convex. Since / is not strictly convex, the set M could be unbounded. 
We show that M is the convex hull of finitely many points. 

3.1.1 Decomposition of the index set 

For every linear function y(x) = ax + b we define J+ := {j : yj > y(xj)}, 
Jo := {j : yj = y(xj)} und J_ := {j : yj< y(xj)}. This decomposition depends 
on the parameters a and b (see Figure pi). The sets J+, Jo and J_ are pairwise 
disjoint, J+ U Jo U J_ = {1, . . . , m} and 

f{a, b ) = E ( y J ~ ax i ~ b ">~ E ( y i ~ ax i ~ b "> 

= 6(i j_i - |j+D + ( E % - E % I - a ( E ^ - E ^ 



3.1.2 Translation 

For any neRwe consider the values of / at lines with fixed slope a and variable 
b. We want to determine the optimal line with slope a, i.e., min{/(a, b) : b 6 K.}. 

Note that f(a, 0) = X)j=i 1%' — axj|. Set /3j := yj — axj. Let cr be a 
permutation satisfying P a (i) < ■ ■ • < /?a-(m) ( see Figure SI). 
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Lemma 2. For every a € R i/ie function f(a, •) : R — > R, b i-> /(a, 6) admits a 
global minimum. If f (a, bo) = min{/(a, 6) : 6 e R} ; i/iera /3cr(i) < 0o < Palm)- 

Proof. If o > /? CT ( m ), then J + = J = 0. If 6 < then J_ = J = 0- 

Consequently, 



/(a, 6) = mb - Vj + a x i for b > ^( m ) 



and 



/(a, 6) = -m& + % - a Xj for 6 < /3 ct(1) . 

The function /(&) is strictly decreasing for 6 < and strictly increasing for 

b > Pa(m)- Continuity of / yields 

inf f(a,b) = inf f(a,b) = min f(a,b). 

□ 

Lemma 3. For every a € R i/ie function f(a, •) admits its global minimum at 
b if and only if |J_| + |J | - | J+| > und |J+| + |J | - | J_| > 0. 

For every set £/ie function f(a, •) admits its strict global minimum at b if 
and only if\J-\ + \Jo\ — \ J+\ > und \ J + \ + \ J \ — |J_| > 0. 

Proof. Let J + , Jq, J_ be the decomposition of the index set of the function 
y(x) = ax + b. If max{/3j — 6 : j e J_} < e < mm{(3j — b : j e J + }, then 
the decomposition of the index set of the function ax + b + e is equal to the 
decomposition of ax + b. Now 

f(a, b + s) = (Vj - ax i - b-e)- ^ {yj - axj - b - e) + ^ |e| 
j£J+ j£J- ie./o 



= f(a,b)+e- 



|J_| + |J |-|J+| ife>0 
|J_|-|J |-|J+| ife<0 



□ 

Corollary 4. For every a e R the function f(a, •) admits its global minimum 
at b if and only if \\J+\ — \ J-\\ < \ Jo\- 

Furthermore, min{/ (a, 6) : b G R} = /(a, b ) if and only if (3 k < b < Pi with 



k 



m + 1 



m + 1 



Proof. The inequalities | J_ | + 1 J 1 - | J+ 1 > and | J+ 1 + | J 1 - | J_ | > arc both 
satisfied if and only if | Jo | > 1 1 J+ 1 — | J_ 1 1 . 
If Pk < b < Pi, then 
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• |J_| + \ J \ > k, \ J + \ <m-k< \ J-\ + |J |, since 2k > m. 

• |J+| + |J | > to- l + l, \ J-\ < I- 1 < |J+| + |J |, since 2(1 - 1) < m. 

If 6 < /3 fc , then |J_| + |J | < fc - 1 and \ J+\ > to - (fc - 1) > | J_| + | J |, 
since 2(fc — 1) < m. 

If 6 > ft, then |J+| + |J | < m-l and |J_| > Z > |J+| + |J |, since 21 > m. □ 

Corollary 5. Among all optimal lines with fixed slope a €E R t/iere exists at 
least one that contains one of the points pj . 

Proof. If b = i3 k or b = f3 u then J ^ 0. □ 
3.1.3 Rotation 

For every point pk we consider the values of / at lines containing p k . These 
lines are obtained by rotating one of them around p k . The condition y(xk) = Uk 
implies b = y k —ax k . For any fixed index k we investigate the function / : R — > R 
dchned by 

m 

f(a) := f(a,y k - ax k ) = \yj - axj - (y k - ax k )\ 

m 
3=1 

with := y 3 - y k and Xj := Xj - x fc . 

The coordinate change (xj,yj) — ¥ (xj,yj) corresponds to the translation of 
the origin to the center of the rotation. The decomposition of the index set is now 
given by J_ = {j : yj < ax 3 }, J = {j : y 3 = axj} 3 k and J + = {j : yj > ax }. 
It follows 

/(a) = Vi - E Vj ~ a [J2 x 3 ~ E 5 J ■ ■ 

Set ctj :— jjj/xj for j ^ k. Let r be a permutation satisfying r(m) = k and 
a T (i) < • • ■ < a r(m _i) (see Figure[3]). 

Lemma 4. T/ie function f : a i— > /(a) admits a global minimum. 
If f(ao) = min{/(a) : a e R}, £/ien a T(1) < a < a r(m _i). 

Proof. If a > a T ( ro _ 1 ), then J = {fc}, J + = {j : Xj < 0}, J_ = {j : > 0} 

and f'(a) = *£jeJ_ x j ~ J2jeJ+ x 3 > °- 

If a < a r (i), then J = {k}, J+ = {j ■ Xj > 0}, J_ = {j : ij < 0} and 

/'(«) = EjeJ_ % ~ T,jeJ+ x 3 > °- 
Continuity of / yields 

in f/( a ) = ^ f( a )= ™} n /(«)■ 

□ 
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Corollary 6. The function f : R 2 — > K defined by f(a, b) = Y^j=i \Uj — ax j ~ &l 
admits a global minimum. This global minimum is attained at a bounded set. 

Proof. If /(a, b) is a global minimum of /, then ao < a < a\ and bo < b < b\ 
where 

. Vj - Vk Vj-Vk , . , 

ao = mm — , a\ = max — , bo = mm?/,- — aiXj, b\ — maxy,- — aiXj. 

]^k Xj - X k j^k Xj - X k 1,3 i,j 

□ 

Lemma 5. For every index k holds min{/(a) : a G R} = min{/(ay) : j ^ fc}. 

Proof. If Jo = {k}, then / is differentiable at the line corresponding to that 
decomposition. Note that Jo = {k} if and only if a ^ ctj for all j =^ k. If 
/'(a) = for ael with a a r k -\ < a < icr(fc+i), then / is constant on the interval 
[<V(fc)> a <r(fe+i)]- Q 



3.1.4 The set of optimal lines 

Corollary [5] and Lemma [5] imply the existence of a line containing at least 
two pj ^ p k among all lines with minimal algebraic L^distance to the set 
{pi, . . . ,p m }. There are < m(m — 1)/2 lines with this property. It is sufficient to 
check the lines that additionally satisfy 1 1 J+ 1 — | J_ 1 1 < | Jo | to find the minimum 
of/. 

The line which contains the two points pj and p k with j ^ k is called gj k . 
The line g, jk is the graph of the linear function y(x) = aj k x + bj k with 

a jk = * Vk and b jk = Vk ^ V > Xk . (8) 

Xj X k Xj x k 

The line gj k corresponds to a point in the domain of definition {(a, b) : a,b £ R} 
of the function /. This point (a, b) is the unique solution of the two equations 
yj = axj+b and y k = ax k +b, since Xj ^ x k . We denote this point of intersection 
by gjk too, i.e., gj k — (aj k , bj k ). Let E be the set of optimal lines containing at 
least two of the points pj . More precisely, 

E := {g k :j>k and f(a jk ,b jk ) = min{/(o, 6) : a, b £ R 2 }}. (9) 

Theorem 4. Let pj = (xj, yj) T , j — 1, • ■ ■ , m, such that Xj ^ x k for all j ^ k. 
A linear function y{x) = ax + b has minimal algebraic L 1 -distance to the set 
{pi, . . . ,p m } if and only if (a,b) is contained in the convex hull of E. 

Proof. If /(ao, &o) is the global minimum of / and y k — aox k + bo for exactly one 
index 1 < k < m, then Lemma [5] implies the existence of indices 1 < j\,j2 < m 
satisfying j t ,j 2 ^ k and f{g nk ) = f(gj 2 k) = /(a ,& )- Consequently the point 
(ao,&o) li es 011 the line y k = ax k + b between the points 9j 1 k,gj 2 k £ E. 

If /(a ,6o) is the global minimum of / and yj ^ ao^j + bo for all j, then 
Corollary [4] implies the existence if indices k, I satisfying /(ao, bo) = /(ao, b) for 
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all b k ■= Vk - a x k < b < yi - a yi =: h. Since f(a ,b k ) = f{a ,bi) is the 
global minimum of / and the lines corresponding to (oo, bk) and (ao, bi) contain 
the point pk respectively pi, the points (ao, bk) and (ao, h) are contained in the 
convex hull of E. Now (ao,i>o) is a point of the line segment from (a ,bk) to 
(ao, h). Therefore, (ao,&o) is in the convex hull of E. □ 

Remark 3. With a small change in the definition of E Theorem [4] remains true, 
if the condition Xj ^= Xk for all j ^ k is weakened to existence of indices j =/= k 
satisfying Xj ^ Xk- This weaker assumption is sufficient to bound interesting 
slopes using Lemma [4j because ctj ^ ±00 for at least on index j. To adjust the 
definition of E we then regard only lines gjk with Xj ^ Xk- 

However, if x x = . . . = x m = 5, then any graph of a linear function contains 
at most one of the points pj. This would mean that E = 0. But note that 
the algebraic distances between the points pj and the linear function y(x) = 
a(x — x) + b are independent of the variable a. We substitute the set E by 
the set B := {y k : f(0,y k ) = mm{/(0 )% ) : j = 1,...,™}} C E. Now, 
y{x) = a(x — x) + b is an algebraically i^-optimal linear function if and only if 
b is contained in the convex hull of B. 



3.1.5 Examples 

Three points: We show the existence of a unique linear function with minimal 
algebraic I^-distance to three given pairwise distinct points. Let x\ < X2 < 2:3 
and 2/1,2/2)2/3 £ K. There exists a line containing all three points (xj,yj) T if 
and only if (y 2 - yi)/(x 2 - x x ) = (y 3 - yi)/(x 3 - x x ). 
If (2/2 ~ yi)/{x2 - xi) ^ (2/3 - yi)/(x3 ~ xi), then 
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because x 3 — xi > X2 — xi and x 3 — xi > x 3 — X2- There is a unique line with 
minimal algebraic L 1 -distance to {pi,P2,P3}- It is the line through pi and p 3 . 
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Family of optimal lines for m — 4: We consider the points p\ = (0,0) T , 
p 2 = (1,1) T , p 3 = (2,2) T and p 4 = (3,3/2) T (see Figure [J}. Note that g 2 i = 
.931 = 532, because the three points pi, p 2 and p 3 are collinear. Now /(321) = 
3/2, /(ff43) = 9/2, /( 542 ) = 3/2, f(g 4 i) = 3/2. Thus, the set M of optimal lines 
is the convex hull of 521 = .931 = 332, 341 and 342 (see Figure [5]). 

Invar iance under reflection for m — 5: Let us consider the five points 
p x = (-2,1) T , p 2 = (-1,-lf, P3 = (0,0) T , p 4 = (l,-l) r and p 5 = (2,l) T 
(see Figure [6]) . This set is symmetric with respect to the reflection in the y- 
axis. Hence, the set M is invariant under this reflection. The lines 351, 321, 
342 and 354 appear as dotted lines in Figure [6j because they do not satisfy the 
condition ||J + | - |J_|| < \J \. Now f(g 31 ) = f(g 53 ) = 4, f(g 32 ) = f(g i3 ) = 6 
and /(.g 41 ) = f(g 52 ) = 13/3 > 4. 

The set M of optimal lines is the line segment from g§ 3 to 331 in Figure [7j 
The elements of M correspond to the functions y(x) — ax with — 1/2 < a < 1/2. 



1G 



J. 



X 

06 



J. 



Figure 8: geometric decomposition of Figure 9: definition of p+ and p_ for 
the index set, definition of J a (j) lines through pk 

3.2 Minimal geometric ./^-distance 

Given pairwise distinct points pj £ R 2 , j = l,...,m, we determine lines de- 
scribed as g = {q £ R 2 : c = (q,n)}, c € R, n e S 1 with minimal geometric 
Z^-distance to the point set {p\, . . . ,p m }, i.e., the minimum of the function 
/ : R x S 1 -)• R defined by 

m 

f(c,n) =^|c- (Pj,n)|. 
j'=l 

Similar to the definitions in subsection |3 . 1 . 1 1 we decompose the index set. For 
every line {q £ R 2 : (<?, n) = c} set J + := {j : (j>j, n) > c}, J := {j : (p,, n) = c} 
and J_ := {j : (pj,n) < c} (see Figure [8]). Now 

/(c,n) = ^ (c- (pj,n)) + ^ - c) 

3.2.1 Translation 

For any n £ S 1 we restrict / to lines with fixed normal vector n. We want to 
determine the optimal line with normal vector n, i.e., min{/(c, n) : c £ R}. Set 
7j := (pj,n) and let a be a permutation satisfying 7 CT (i) < •■• < 7er(m) (see 
Figure [|]). 

Lemma 6. For every n £ S 1 the function f : R — > R, c > /(c, n), admits a 
global minimum. 7/min{/(c, n) : c £ R} — f(co,n), then J a (i) < c o < la(m)- 

Proof. The function / :ch> f(c,n) is continuous, piecewise linear and convex. 
If Jo = 0, then / is diffcrcntiable and /'(c) = | J_| — \J+\- 
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If c > 7o-( m ), then Jo = J+ = and /'(c) = m > 0. If c < r y a {x\i then 
Jo = J- = and /'(c) = — m < 0. Continuity of / yields 



inf /( c ) = inf {/(c) : 7<x(i) < c < 7<x(m)} = mm{/(c) : 7 . (1) < c < 7 <T(m) }. 



□ 



Lemma 7. For every n Q S 1 the value f(c, n) is a local minimum of the function 
f(-,n) if and only if |J_| + |J | - |J+| > and \ J + \ + |J | - \ J-\ > 0. A local 
minimum is strict if and only if | J_ | + 1 Jo | — | J+ \ > and | J + 1 + | Jo | — | J- 1 > . 

Proof. Let J + , Jq, J_ be the decomposition of the index set for the line (q, n) = 



If max{7j 



j G J_} < e < min{7j — c : j G J+}, then the decomposition 



of the index set for the line (q, n) = c + e is equal to that for (q, n) — c and 



f(c + e,n) = (\J-\-\J+\)(c + e)- ^(p„n)+ 5^(p is i 



•EN 



/(C, «)+£• 



|Jo|-|J+| ife>0 
|J Q |-|J+| ife<0 



□ 



Corollary 7. For every n G S" 1 £/ie -uaZue /(c, n) is a global minimum of the 
function f(-,n) if and only i/||J + | — |J_|| < |Jo|- 

Furthermore, min{/(c) : c G R} = /(cq) «/ and orji?/ £/7fc < cq < 7; wii/i 



fc = 



1 



1 



Proof. Replace f3 by 7 in the proof of Corollary [4] 



□ 



Corollary 8. Among all optimal lines with fixed normal vector n there exists 
at least one containing one of the points pj . 



Proof. If 6 = 7^ and 6 = 7;, then Jo ^ 

Corollary 9. The function / : R X S 1 - 
a global minimum. 



□ 



l-° defined by (c,n) 1— > f(c,n) admits 



Proof. Since the function / is continuous and the set S 1 is compact, we obtain 
inf{/(c, n) : c G R, n G S* 1 } = mm{f((pj, n), n) : n G 5 , j = 1, . . . , m}. □ 



3.2.2 Rotation 

For every point pk we restrict / to lines containing p^. The normal vector of the 
lines is variable, but the condition pk G {q G R 2 : c = (q, ri}} implies c = (pk, n). 
We investigate the function / : S 1 — >• R given by 

m m 

/W : = f((Pk,n),n) = \{Pj,n) ~ (p*,n)| = ^ |(pj -Pfe,n)| = ^ |(Pj,n)|, 
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where pj := pj — pk 7^ for all j 7^ k. 

As before, the change of coordinates pj i-> pj corresponds to the translation 
of the origin to p k . For any normal vector n £ S 1 the decomposition of the 
index set with respect to the new coordinates is given by J + = {(p~j,n) > 0}, 

= {(Pj, n ) = 0} 3 k and J_ = {(p~j,n) < 0} (see Figure [9}. Now 

f(n) = X (& ,n ) ~ X (pj' n ) = ^> n ) where p := X] & ~ X 

Lemma 8. If (pj,no) 7^ for all j 7^ k, then min{/(n) : n £ S 1 } < /(no). 

Proof. Note that p fe = 0. Hence, \ J Q \ > 1. If J+ U J_ 7^ 0, i.e., (p 3 -,n) ^ for 
at least one index j 7^ k, then f(n) — (p, n) > 0. Thus, p 7^ 0. 

Using the parametrization n(t) = (cosi,sini) T of S 1 as in subsection 



2.2.4 



we define the function h(t) := f(n(t)). If (Pj,n) 7^ for all j 7^ k, i.e., J = {fc}, 
then / is differentiable at the corresponding point. Now 

^ = p T Jn = (p, Jn), h"(t) = p T JJn{t) = -p T n(t) = (-p, n{t)). 

If f(n(t)) is a local extremum of / and Jo = {k}, then there exists A ^ 
such that p — An, since h'(t) — an Jn _L n. A local extremum f(n(t))) of 
/ satisfying Jq — {k} is a local minimum if and only if A < 0, since h"(t) = 
— X(n, n) = —A. Set p + := J2je.j Pj an< ^ P- : = Pi- Note that p + — p- = 

p, (p + ,n) > and (p_,n) < 0. This implies (p + ,An) < 0, (p_,An) > 0. Thus, 
(p + — p_,An) < contradicting (p+ — p_, An) = (p,p) = ||p|| 2 > 0. □ 

3.2.3 Summary of the geometric i 1 -distance 

Corollary [7] and Lemma [8] imply the existence of a line containing two of 
the points pj among all lines with minimal geometric L -distance to the set 
{pi, • • • ,Pm}- These are at most m(m — l)/2 lines. As in section 3.1 it is 
sufficient to check the lines which additionally satisfy 1 1 J+ 1 — | J- 1 1 < | Jq \ . 

As before, we denote the line through pj and pk with j > k by gj k . The 
normal vector of gj k is rijk := J(pj — Pk)\\Pj — Pfell -1 - The line gj k is given by the 
equation (q,rijk) = (pk>njk) = ; c jk- Let E be the set of points in the domain / 
corresponding to optimal lines. More precisely, 

E := { ± (c jk ,n jk ) : f(c jk ,n jk ) = min{/(c,n) :c£R,n£ S 1 }} . (10) 

Since / : M x S 1 — > K is only convex with respect to c, we perform the convex 
hull only in one direction and define 

E := {(c,n) £ KxS 1 : 3t £ [0, 1], (c , n), (c Xj n) G £ : c = <c + (1 - t)a}. (11) 

Theorem 5. Let pj £ R 2 , j = 1, . . . , m, be pairwise distinct points. 

The line defined by the equation (q, n) = c with n £ S 1 and c £ R /ias minimal 

geometric ^-distance to the set {pi, . . . ,p m } if and only if (c, n) £ E . 
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Proof. The statement is a consequence of Corollaries [7J and [8] 



□ 



Remark 4. As long as the set E can be defined Theorem [5] remains true if the 
condition pj ^ pk for all j ^ k is omitted. However, if p\ = . . . = p m = p 7 then 
all lines containing p have zero distance to the point set and are optimal. Note 
that only finitely many normal vectors occur in E for a generic point set, whereas 
optimal lines with any normal vector exist in the special case pi = . . . = p rn = p. 

3.2.4 Examples 

Three points: Three pairwise distinct points p\, P2, P3 form a triangle D. 
Let A be the area of D. If j > k and I ^ j, k, then f(gjk) — d(pz, <?jfc)- Since 
2A = \\pj — PkWfidjk) f° r all j 7^ k, the line gjk is an optimal line if and only if 
gjk contains the longest edge of the triangle D. If there are two longest edges 
(Z 2 -symmetry) or if D is an equilateral triangle (63-symmetry), then there 
exist two respectively three geometric i 1 -optimal lines. 



Four points without symmetry ad- 
mitting two optimal lines: Let us 

consider the four points p\ = (0,0) T , 
P2 = (2,0) T , P3 = (3,0) T , Pi = (2,3) T 
(see Figure 10 1. If the lines 343 or (741 
were optimal, then there would exist an 
optimal line containing exactly one of the 
points pj . This would contradict Lemma 
[HJ since translation into the direction of 
P2 does not change the value of /. 

The normal vectors of the lines 321 = 
331 = 932 and g i2 are n 2 i = (0, 1) T re- 
spectively n 42 = (1,0) T . Now /(0, n 2 i) = 
3 = /(2,n4 2 ). Since n 21 7^ ±ri4 2 there 
exist exactly two lines with minimal geo- 
metric L 1 -distance to the set {pi, . . . ,pa\. 



941 



P4 



942 



943 



921 -931-S 32 



Pi 



P2 



\ p 3 



Figure 10: 



Invariance under reflection for m — 5: Consider the points pi = (—2, 1) T , 
P2 = ( — 1, — 1) T , P3 = (0,0) T , pi — (1,— 1) T and p§ = (2, 1) T as in subsection 
|3.1.5| and Figure [6] The lines g 2 i, 542, ff54 and 551 do not fulfill the condition 
I Jo I > - \J-% since for those lines | J Q \ = 2 < 3 = ||J+| - |J_||. 

Now n 53 = (l,-2) T / v / 5, n 43 = (1,1) T /V2, n 52 = (2, -3) T /Vl3, c 53 = 
C43 = and C5 2 = 1/ y/l3. Using the reflection symmetry we obtain 

/(0, nai) = /(0, n 53 ) = (4 + 1 + 3)/y/5 = 8/s/E, 
/(0, n 32 ) = /(0, n 43 ) = (1 + 2 + 3)/>/2 = 6/^2 > 8v^, 
/(c 4 i, n 41 ) = f{c 52 , n 52 ) = (8 + 1 + 4)\/l3 = Vl3 > 8^5. 
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The lines 331 and 353 are not parallel. Hence, E = E. There are exactly two 
lines with minimal geometric i^-distance to the set {p\, . . . ,ps}. These are (731 
and g 53 . 



4 Maximal distance - L°°-norm 

In this section we minimize the largest of the distances of the single points pj . We 
consider the continuous function f(g) = max{dj : j = 1, . . . , to}. The function 
/ is differentiable if there is exactly one largest dj. We show the existence of 
the global minimum of / for the algebraic and the geometric distance. The set 
of optimal lines is finite in both cases, because optimal lines are located in a 
special manner between the edges and vertices of the convex polytope generated 
by pi, .. . ,p m - Using the vertical distance the function / becomes convex and 
the global minimum is then attained at a unique line. 



4.1 Minimal algebraic L°°-distance 

Given points pj = (xj,yj) T € M 2 , j = 1, . . . , to, we determine the linear func- 
tion y(x) = ax + b, a,b € M, with minimal algebraic L°°-distance to the set 
{pi, . . . ,p m }, i-e., the minimum of the continuous, piecewise linear and convex 
function / : R 2 — >• M defined by f(a, b) = max{\yj — (axj + b)\ : j = 1, . . . , m}. 
As in the sections 2.1 and |3.1| we additionally assume that Xj 7^ x^ for all 



j 7^ k and decompose the index set {pi, . . . ,p m } into J + = {j : yj > y(xj)}, 
Jo = {j ■ Vj = y(xj)} and J_ = {j : y i < y{xj)}. Now 

/(a, b) = max {0, max{j/j — axj — b : j € J+}, maxjaxj + b — yj : j G J-}} ■ 



4.1.1 Translation 

As in section [3.1.2| we restrict / to lines with arbitrary fixed slope nel. Again, 
set j3j :— yj — axj and let a be a permutation satisfying < • • • < ^o-(m) 

(see Figure [5| . 

Lemma 9. min{/(a, b) :6eR} = f{a,b ) if and only ifb Q = (/3 <T(m) +/3 CTCT(1) )/2. 
Proof. The assertion follows from 

f(a,b) = max {\f3j - b\} = nmx{\(3 a{m) - b\, \(3 a{1) - b\} > {0 a r m) - P a m)/2 

andmax{|/? CT(m )-6|, = (/3 . (TO )- ( S er (i))/2 <^ b = {j3 a(m) +l3 IJa{1) )/2. □ 

Corollary 10. 7/min{/(a, b) : a,b g M} = f(ao,bo), then there exist indices 
k 7^ I such that /(ao,6o) = A — °o = bo — fik and the point (pk +pi)/2 lies on 
the line defined by y = a^x + bo . 
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Proof. The equation /(a ,6o) = f3i — bo = bo — f3 k holds for k = cr(l) and 
I = a(m). It is easy to check that 

/ x k + xi\ x k + xi 1 + y/ 

2/ z = ao ^ h o = - (a x fe + a xi + Pi + Pk) = 



2 ) 2 2 V " ' 2 

for the function y(x) = a a; + b . □ 

4.1.2 Rotation 

For any pair of indices (fc, I) with fc 7^ ^ we consider the values of / at lines 
containing the point (pk + pi)/2. The condition y((xk + xi)/2) = (yk + yi)/2 
implies 26 = (yk + yi) — a(xk + xi). We investigate the function/ : K — > K 
defined by 

f(a) := f(a, (y k + yi ~ a(x k + xi))/2) 

= max{|y J - ax 3 - (y k + yi)/2 + a(x k + xi)/2\ : j = 1, . . . , to} 
= max{|y., - a,Xj\ : j = 1, . . . , to} 

where i 2 := - (x k + x t )/2 and ^ := y i - (y k + yi)/2. Note that x k , x x 7^ 0, 
since the coordinates Xj are pairwise distinct. 

Lemma 10. If f(a ) = min{/(a) : a € R}, f{a ) = y t - a i; = a x k - y k and 
/(ao) > 0, then there exists an index j 7^ fc, I such that /(ao) = \yj — aoXj\. 

Proof. Note that \yi — axi\ = \yk — axk\ holds for all afK, because yi — axi = 
(xi - x k )/2 - a(yi - j/ fe )/2 and y k - ax k = {x k - Xi)/2 - a(y k - yi)/2. If the 
inequality \yj — aoXj\ < \ijk — aoXk\ holds for all j 7^ k, I, then / is difierentiable 
at ao and /'(ao) = —xi = ifc 7^ 0. □ 

4.1.3 Existence and Uniqueness of the optimal line 

Corollary 11. The function f : M. 2 —> R defined by (a, b) 1— » m&x{\yj — axj — b\ : 
j = 1 . . . , to} admits a global minimum. 



Proof. For any k 7^ I the continuous functions investigated in subsection |4. 1.2 



admit a global minimum, since lirria^-i-oo \yj —axj \ = 00 for all j with x j 7^ 0. □ 



If < /(ao, 60) = mm {/( a i b) : a, 6 g R}, then Corollary 10 and Lemma 10 
imply the existence of three pairwise distinct indices k\ , ki , ^3 with the following 
properties: The line g kl k 2 through p kl and pk 2 is parallel to the line given by 
y = a Q x + b 0l f{a ,b a ) = \y k . ~a Q x kl - b \ for i = 1,2,3, and p k3 9k x k 2 - Thus, 
the value /(ao, 60) is half of the vertical distance between the point pk 3 and the 
line g kl k 2 , i.e., 



f{a ,b ) = - 



Vk 2 ~Vkx i \ 
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Lemma 11. If f(ao,bo) = f(ai,bi) — min{/(a, 6) : a,b € M.}, then ao = ai 
and bo = b\ . 

Proof. There are only finitely many triples (ki,k2,ks) with the properties de- 



scribed in Lemma 10 Therefore, the global minimum of the function /(a, b) 
is attained only at finitely many lines. Since / is a convex function, the set of 
optimal lines is a convex set. Finite convex sets contain at most one element. 

This lemma can be proven without making use of the general properties 
of convex functions directly. We provide this second proof to clarify the link 
between the uniqueness of the optimal line and the standard assumption Xj ^ Xk 
for all j 7^ k: The condition d = /(ao, 6o) = /(oi, b\) implies 

Pj G {(x,y) T : d>\y — aix — bi\,i = 0, 1} =: M for all j. 

If ciq = cti, then all points pj are contained in the intersection of two parallel 
stripes of vertical width 2d. This intersection is a parallel stripe of width < 2c? 
if and only if bo ^ b\, since 

M = {(x, y) : d + min(6o, &i) > y — a$x > —d + max(6i, bo)} 

for Oo =ax. 

If a\ 7^ ao, then M is a parallelogram. Using the triangle inequality we 
obtain M C {{x,y) : 2d > \2y — (ai + ao)x — {b\ + &o)|}- Since d is the global 
minimum of the function /, it follows that /( gq + ai , b °+ bl ) = d. A small calcu- 
lation, that again uses the triangle inequality, shows that 

M n {(x, y):2d= \2y - {a x + a )x - (h + b )\} = {(x , a x Q + b ± d)} 



for xq = —(bi — b )/(ai — a ). This fact contradicts Corollary 10 since the 



coordinates Xj are pairwise distinct. □ 
4.1.4 Summary of the algebraic L°°-distance 

The notions convex hull of the set {pi, . . . ,p m }, edge and vertex are appropri- 
ate to effectively characterize the triples of indices with the properties being 
specified in subsection |4.1.3| . 

The convex hull H({pi, . . . ,p m }) of the points Pi, ■ ■ ■ ,p m is denoted by P. 
A point pk is a vertex of the polytope P, if pk is not contained in the convex 
hull of the remaining points pj, j ^ k. The set of vertices is denoted by V, i.e., 
V := {pk ■ Pk & H({pj : j 7^ k})}. The boundary dP of the set P consists of 
line segments of the form Ski '■= {pi + t{Pk — Pi) ■ < t < 1} with k, I G V. 

Theorem 6. Let pj = (xj, yj) T ■ j — 1, . . . ,m, such that Xj ^ Xk for all j 7^ k. 
There exists a unique linear function with minimal algebraic L°° -distance to the 
set {pi, . . . ,Pm\. Moreover, 

min f{a, b) = min max \yj — axj — b\ = min max \yj — axj — b\ 

a,bGK a,b£R j=l,...,m a,b£Kpj£V 



1 

- mm max 

2 s kl edP Pj ev 



Vk -yt , , 
yj - yk {xj - x k ) 

x k - Xl 
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If < /(ao,&o) = min{/(a, 6) : a, b G M}, </ien i/iere exzsi pairwise distinct 
indices ki, tv2, &3 G smc/i i/iai /(ao, bo) = \y ki — ctox ki — &o| for i = 1,2,3 and 
S klk2 C 5P. 

Corollary 12. // £/ie sei {pi, . . . ,p m } is invariant under the reflection in the 
line defined by x = x, then the linear function with minimal algebraic L°°- 
distance is 

y(x) = - (max}?/,- : j = 1, . . . , to} + min{y,- : j = 1, . . . , to}) . 

Proof. Similar to the proof of Lemma [l] this assertion follows from the unique- 
ness of the optimal line and Lemma [9] □ 

Remark 5. Note that the optimal lines with minimal algebraic L°°-distance 
depend only on the convex hull P of the points pi, ■ ■ ■ ,p m - In particular, the 
minimum of / is equal to zero if and only if P is a line segment, i.e., there 
are at most two vertices. As long as only edges not parallel to the y-axis are 
considered, Theorem [6] remains true, if the condition Xj ^ x k for all j ^ k is 
weakened to the existence of indices j ^ k satisfying x.j ^ x k . 

However, if x\ = . . . = x m = x, then P is a line segment parallel to the 
y-axis. The linear functionen y{x) — a(x — x) + b has minimal algebraic L°°- 
distance to the set {p±, . . . ,p m } if and only if a G M and 

b = \ (min{yi, • ■ • , y m } + max{yi, . . . , y m }) . 
In contrast to Theorem [6j these are infinitely many optimal lines. 



4.1.5 Examples 

Three points: As in the first example in subsection |3.1.5| we consider three 
points Pj = (xj,yj) T G R 2 with X\ < x 2 < £3. If Pi,P2,P3 ar e not collinear, 
then E = {p\,P2,P , i\- Applying the results of subsection 3.1.5 we conclude that 
the line parallel to S31 with equal vertical distance to S31 and p 2 has minimal 
algebraic L°°-distance to {pi,P2,P3}, because P2 is the given point with smallest 
vertical distance to the corresponding opposite side of the triangle. 



Four points: We consider the examples concerning four given points of the 
subsections 3. 1 . 5| (see Figure [4]) and 3.2.4 (see Figure 10 1. The linear functions 
with minimal algebraic L°°-distance can be quickly determined for these given 
point sets, because three of the four given points are collinear in both cases. 
Hence, the set of vertices consists of three elements. As in the paragraph above, 
the optimal line is parallel to the line through the two vertices with smallest 
respectively largest x-coordinate (see Figures 11 and 12). 



Invariance under reflection symmetry: The set of points Pi, ...,ps in 
Figure [6] ist invariant under reflection in the y-axis. Corollary [12] leads to the 
optimal linear function y(x) = 0, since p — (0,0) T . 
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Figure 11: 



Figure 12: 



4.2 Minimal geometric L°°-distance 

Given pairwise distinct points pj € M 2 , j = 1, . . . , to, we determine lines given 
by j = {g e M 2 : c = (g, n)}, c € t, n G S 1 with minimal geometric L°°-distance 
to the set {pi, . . . ,p m }, i.e., the minimum of the function / : RxS 1 -> R defined 
by /(c,n) = max{|c- (p^,n)| : j = 1, . . . ,to}. 

As in subsection 3.2 we decompose the index set {1, . . . , to}. For every line 
{? £ I 2 : (q,n) = c} set J + := {j : (ft',n) > c}, J := {j : (pj,n) = c} and 
J_ := {j : (Pj,n) < c} (see Figure [8]). Now 

/(c, n) = max{0, max{(pj, n) - c : j e ^+}> max{c — (pj, u) : j € </-}}• 



Similar to the investigation of the algebraic L°°-distance in section [4~T] we show 
that the function / admits a global minimum and describe optimal lines by 
means of edges and vertices of the convex hull of pi , . . . , p m . Transferring the 
statements of section 4.1 note that /(c, n) is convex only in the variable c. 

4.2.1 Translation 

Set 7j := (pj,n). Let a be a permutation satisfying 7 CT (i) < ... < 7<r(m) as hi 
subsection 3.2.1 (see Figure [8J. 

Lemma 12. 7i ZioWs min{/(c, n) : c G M} = f(ca,n) if and only if cq = 
(7<r(m) +7<r t7(1) )/2. 

Proof. Replace 6 by c, /3 by 7 and a by n in the proof of Lemma [9] □ 

Corollary 13. If min{/(c, n) : c € M, n 6 5 1 } = f(cQ,no), then there exist 
indices k ^ I satisfying /(c , ri ) = 7 - c o = c - 7fc a™d ((p/fc + Pi)/ 2 ; "o> = Cq. 

Proof. Replace (3 by 7 and 60 by Co in the proof of Lemma |10| □ 

Corollary 14. The function f : K x S* 1 — >• K defined by the assignment (c, n) 1— > 
max{|c — (pj,n.)| : j = 1, . . . , to} admits a global minimum. 
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Figure 13: 



Figure 14: 



Proof. There are only finitely many points of the form (p k +pi)/2 with k > I, 
S 1 is a compact set and the function / is continuous. □ 

4.2.2 Rotation 

For any pair of indices (fc, I) with k =/= I we consider the values of / at lines 
containing the point (p k +pi) /2. The condition (p k +pi)/2 G g = {q : (q, n) = c} 
implies ((pk +pi)/2,n) — c. Therefore, we investigate the function / : S 1 — > K 
dchned by 

f(n) :=f(({p k +Pl)/2,n),n) 

= max{|((p fc +pi)/2,n) - {pj,ri)\ : j = 1, . . . , m} 

= max{\(pj - (p k +pi)/2,n)\ ; j = 1, . . . , m} 

= max{\(pj,n)\ : j = 1, . . . ,m} where pj := pj - (p k +pi)/2. 

Note that p k = -pi ^ 0, since p k = (p k - pi)/2, pi = (p t - p k )/2 and p k ^Pi- 

Lemma 13. If f(n ) = min{/(n) : n G S 1 }, f{n ) = (pi,n ) = -(p k ,n Q ) and 
/(no) > 0, then there exists an index j ^ k,l such that /(«o) = \(Pji n o)\- 

Proof. Note that (pi,n) = —(p k ,n) for all n € S 1 , since p k = —pi. If the 
inequality | (pj,n(t Q )) \ < f(n(to)) holds for all j ^ k, I, then the function h(t) :— 
f(n(t)) given by the parametrization n(t) = (cost,smt) T is differentiable at t . 
Note that / € J + . Now, h'(to) = pfjn(t ) — if and only if there exists A > 
such that pi — Xn. If h'(to) — 0, then h"(to) — pf JJn(to) = —A < 0, since 
n £ S 1 and J 2 = — Id. Consequently, if \(pj,n(t)) \ < h{t) for all j ^ k,l, then 
h(t) is not a local minimum, (see Figure |l3"|) □ 



4.2.3 Summary of the geometric L°°-distance 



Using the notation of subsection 4.1.4 for the convex hull and its edges and 



vertices we characterize lines with minimal geometric L°°-distance. 
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Theorem 7. Let pj 6 K. , j = 1, . . . , m, be pairwise distinct points. 
There exists a line with minimal geometric L°° -distance to the set {pi, . . . ,p m }- 
If < /(co,no) = min{/(c, n) : c 6 K,n € S 1 }, then there exists pairwise 
distinct indices ^1,^2,^3 € V such that f(cQ,n Q ) = \{Pki> n o) ~ c o\ fori = 1,2,3 
and Sj. 1 k 2 C dP. Moreover, 

min f(c. n) — min max , n) — c\ = min max |(p,-, n) — c| 
ceR.nGS 1 v ; cei,nesij=i,.,ro lw ' ' 1 cm Pj ev ,x yj ' ' 1 

1 . \{Pj ~Pk,J(pi -Pk))\ 

— - mm max — n n . 

2s kl edP Pj ev \\Pi - Pk\\ 

In particular, the set of optimal lines is finite. 

Proof. If pi , . . . , p m are not collinear, then Lemma [13] and Corollary [13] imply 
the following properties of a line g with minimal geometric i°°-distance to the 
set {p\, . . . ,p m } : The line g is parallel to an edge Sk 1 k 2 of the polytope P. 
The geometric distance between any vertex in V and g is less or equal to the 
geometric distance between Sk 1 k 2 an d 9- There exists a vertex p k3 e E such 
that pfe 3 ^ 5fe 1 fe 2 and the geometric distance between g and p/c 3 is equal to that 
between g and Sk 1 k 2 - Thus, the minimum of / is half of the geometric distance 
between S kl k 2 and p ks . 

Since n = J(pi — p k )\\Pi ~ Pfcll 1 is a normal vector of the line through 
the points p k ^ pi, the geometric distance between pj and Ski is given by 
\(Pj -Pk,J{Pi -Pk))\\\Pl -PfclT 1 - □ 

Remark 6. The geometric, just as the algebraic, L°°-distance to a point set 
{pi, . . . ,p m } depends only on the convex hull of the points pi,.. ■ ,p m - Hence, 
Theorem [7] remains true if the condition pj ^ p k for all j 7^ k is weakened to 
the existence two indices j 7^ k such that pj 7^ p k . If p\ = . . . = p rn = p, then 
P = V = {p} and any line through p has minimal distance zero. 

4.2.4 Examples 

Three points: We consider three pairwise distinct points Pi,P2,P3 G as 



in the first example of subsection 3.2.4 If pi,p2,P3 are not collinear, then 



V = {pi,P2,P3}- A line g has minimal geometric L°°-distance to {pi,P2,P3} if 
and only if g is parallel to the longest side s of the triangle and the geometric 
distance between s and g is equal to the geometric distance between g and the 
point opposite to s. 

Four points: Let us consider the four points pi = (0, 0) T , P2 — (2,0) T , 
p 3 = (3,0) T and p± = (2, 3) T (see PigureQ. Obviously, V = {pi,p 3 ,p 4 }- The 



edge S*4i is the unique longest side of the triangle piP3Pi- There exists a unique 
line with minimal geometric _L°°-distance to {pi, . . . ,^4}. This optimal line is 
parallel to S41 and has the same geometric distance to 541 and to p 3 . Compare 



this line to the lines with minimal algebraic _L°°-distance (Figure 12 ) and with 



minimal geometric L 1 -distance (Figure 10), they are all different. 
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Figure 15: 



Figure 16: 



Invariance under reflection symmetry: Let us consider the five points 
Pl = (-2, 1) T , p 2 = (-1, -1) T , p 3 = (0, 0) T , P4 = (1, -1) T and p 5 = (2, 1) T as 
in the subsections |3.1.5||3.2.4| and |4.1.5| and Figure [6j The convex hull of the five 
points is a trapezium with V = {pi,P2,Pa,P5} (see Figure [l5| . We denote the 
geometric distance between the vertex pj and the edge Ski by d(Ski,Pj)- The 
edges S51 and S 42 are parallel. It is easy to see that d(S 42 ,P5) — d(S 4 2,pi) — 
d(S$i,p 2 ) = d(S^i,p 4 ) = 2. The set {pi, ■ ■ ■ ,Pb} is invariant under reflection 
in the t/-axis. Hence, d(S 2 i 7 p 5 ) — d(S 54 ,pi) — 2d(S2i,P4,) — 2d(S , 54,p 2 )- Using 
the formula given in Theorem [7] we calculate 

Mq v I(P5-P2,J(P1-P2))| |(3,2)J(-1,2) T | 8 

d{S 2 i,P5) = n n = 7= = — = > 2. 

IIP1-P2II V5 V5 

Thus, the unique line with minimal geometric i°°-distance to the point set 
{pi, . . . ,^5} is given by the equation y = 0. 

In this example the line with minimal geometric i°°-distance is unique and 
coincides with the line with minimal algebraic L°°-distance. In contrast to 
Corollary |12| the invariance under reflection in the line x = x does not imply 
the existence of an optimal line of the form y = y for the geometric L°°-distance, 
since the optimal line is not unique. Scaling the y-coordinate makes this different 
behavior more obvious. 

We consider the linear map A given by (x, y) 1— > (x, Xy) with 0<A£l and 
pj := Afa). Nowpi = (-2,A) T , p> 2 = (-1,-A) T , p' 3 = (0,0) T , p' 4 = (1,-A) T 
and p' 5 — (2, A) T . The set {p[, . . . ,p' 5 } is invariant under reflection in the y- 
axis for all A G M. Hence, the linear function y(x) = has minimal algebraic 
L°°-distance to the set {p' 1} . . . ,p' 5 }, since p' — (0,0) for all AeR. Similar to 
the calculations above we obtain V = {PijP^iPiiP^}, d{S42,p[) = d(S42,p' 5 ) = 
d(S 51 ,p' 2 ) = d(S 51 ,p' 4 ) = 2A, d(S 2 i,p' 5 ) = d(S 54 ,p[) = 2d(S 2 i,p' 4 ) = 2d(S 54 ,p 2 ) 
and 

d(S 2l , P ' 5 ) = ^' 5 ~ P ^ J(p ' 1 ~ P ' 2))l K 3 ' 2A ) J ( _1 ' 2A ) T I - 8A 



\\P[-P 2 \\ VI + 4A 2 VI + 4A 2 
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Figure [l6| shows the situation for the case A = 2. For 



d{S 42 ,p'i) < d(S 21lP ' 5 ) & + 4A 2 < 4 & |A| < 

the set of lines with minimal geometric L°°-distance to the set {p[, . . . ,p' 5 } 
depends on A > as follows: For A > \/l5/2 there are exactly two optimal 
lines, these are y = A ± 2\x. For < A < \/l5/2 the line y — is the unique 
optimal line. For A = \/l5/2 there are exactly three optimal lines, these are 
y = and y = A ± 2Xx. 



5 L p -Norm 

In this section we want to minimize the L p -norm of the vector (d%, . . . , d m ). We 
consider the function f(g) = X)j=i d% f° r P > 1- This function is continuously 
differentiable, since the function K — > K defined by x i-> |x| p has this property 
for p > 1. The case p = 2 has been discussed in section [2] 

We show that the function / admits a global minimum for every p > 1. For 
p =/= 2 the critical points of / are the solutions of a system of nonlinear equations, 
which, in general, can not be explicitly resolved by a closed formula. 

The function f(g) is convex in some of the parameters of the line g. We 
apply the following lemmas to prove the existence of a global minimum and 
properties of the set of optimal lines in the succeeding subsections. Except for a 
few special examples, e.g., m — 3 or symmetries of the point set, it is impossible 
to provide explicit formulas of L p -optimal lines. Therefore, the global minimum 
of / can be determined in most of the situations only numerically. 

Lemma 14. Let £j £ R n , rjj £ R and p > 1. 

V^jlitj- = ^%i{x £ R n : £jx = 0} = {0}, then the function f : R" -> R 
defined by f(x) — X^JLi \£f x + Vj\ F admits a global minimum. 

Proof. The function / is continuous. Set x := (x, 1) and £j := Now 
/(z) = Efci \lji\ p and 



i'=l 



for x 7^ 0. 



The function x i-> SJLi l£j^l p i s continuous and admits a global minimum 
m > on the compact set {||x| = 1} C 

If m = 0, then there exist xq — (xq, A) with xq £ R n and A £ R such that 
= £jxo = t,J x o + J /jA for all j and ||xq|| 2 + A 2 = 1. The assumption A = 
would imply xq ^ and £,J%o = f° r au j contradicting n^Lj^ — {0}. Hence, 
A 7^ and = /(xo/A) is the global minimum of the function /. 

If 77i > 0, then f(x) > (||x|| 2 + l)P/ 2 m > /(0) > for all x satisfying 
||x|| 2 > (/(0)/m) 2 / p — 1 =: R. Continuity of / and compactness of the set 
{||x|| 2 < R} C R n yield the existence of a global minimum. □ 
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Lemma 15. Let f : R n ->• R be defined by f(x) = Y^=i \£jx + Vj\ p with 
T}j S R, f> > 1 and ^ e R™ satisfying n? =1 £f = n™ =1 {v e R" : £jv = 0} = {0}. 
If Xq,X\ G R™ smc/i £/iai .f(x ) = /(a:i) = min{/(x) : x e ]R™} 7 iften x = X\. 

Proof. For / is convex, f(x t ) = ,f(x ) = f{x{) for all x t = tx\ + (1 — t)x with 
< f < 1. Wc define I := {j : £jx + rfj = £jxi + r\j = 0} C {1, . . . , to} and 
K := P\j e i{x : £jx + rjj = 0} C R". For K is the set of solutions of a system 
of linear equations, K is an affine subspace of R". Thus, K is a convex set. In 
particular, x t e K for all < t < 1. 

The restriction of / to K is given by f\x{x) — \^J X + Vj\ p - The 

functions / and f\j( are twice continuously differentiable for p > 2. But for 
p < 2 the restriction /|k is twice continuously differentiable only on the set 
open subset U := (ljgi{x G K : £jx + rjj ^ 0} C K. 

If x ta ^ x tl and t,J Q x to = ££ar tl = for j , ji ^ then j 7^ ji- Since there 
are only finitely many indices, the intersection U n {x t : 1 < t < 1} is an open 
nonempty subset of {xt : 1 < t < 1}. 

Given x £ K and u e R™, the sum x + v is an element of K if and only if 
t;Jv = for all j e I. The set V := rij e j{i« e R™ : £jw = 0} is a vector space. 
For any x £ U and / » 6 F we consider the function h{t) :— f(x + tv). It 
holds 

h"(o) = p( P + % r 2 (#») 2 > o. 

This means that U n {x t : 1 < t < 1} 7^ contains exactly one point. Thus, 

.To = X\. □ 



5.1 Minimal algebraic //-distance 

Given points Pj — (xj,yj) T e E 2 , j = 1, . . . , m, we want to determine the linear 
function y(x) — ax + b, a, b e R, with minimal algebraic L p -distance to the set 
{pi, . . . ,p m }, i.e., the minimum of the function 

m 

f:R 2 ^ R, /(a, 6) = £ |y, - (ax, + 6)|f. 
i=i 

As for the investigation of the algebraic L 1 -, L 2 - and L°°-distance we addition- 
ally assume that xi, . . . , x m are pairwise distinct. 

The function / is continuous, continuously differentiable and convex. Defin- 
ing the decomposition J + := {j : yj > y(xj)}, Jq := {j : j/j = and 
J_ := {j : t/j < 2/(xj)} for every linear function y(x) = ax + b wc obtain the 
partial derivations 

f b (a, b) = P\Vj - axj - o| p_1 - ^ P\Vo - ax j ~ h \ P ^ 
jeJ- jeJ+ 

f a (a, b) = Y P x i\yj - ax j - b \ P ^ 1 - P x o\Vj ~ ax i ~ b \ P ^ 1 - 
jeJ- jeJ+ 
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Thus, /(a, b) is a global minimum if and only if 



and 



je.T- 



E x ^\yj 



b\ p - 



E 



If p = 2n with n G N and n > 1, then f(a,b) = YlljLiiVj — 



/a (a, b) =pJ2j=i x i(yj 



6) 2n - 1 and/ 6 (a,b)=pE7=i(% 



(12) 



(13) 



\2n-l 



In this case the equations ( 12 ) and ( 13 ) are polynomials in the variables a and 
b of degree p — 1. Even if it possible to eliminate one of the variables, there is 
no general closed formula that expresses the common solutions of the equations 
(12) and (13 1 for n > 1 in terms of the coordinates Xj, yj. 



Theorem 8. Let pj = (xj,yj) T , j = 1, . . . , m such that Xj ^ Xk for all j ^ k. 

For any p > 1 there exists a unique linear function with minimal algebraic 
LP -distance to the set {pi, . . . ,p m }. A linear funktion y{x) — ax+b with s.kR 
has minimal algebraic LP -distance to the set {pi, . . . ,p m } if o,nd only if a and b 



satisfy the equations (12) and (13) 



Proof. The assertion follows from Lemma 14 and Lemma 15 with r/j — yj and 



( -"ji' - 

satisfied, since j > 1 and Xj ^ Xk for all k ^ j. 



T) T . The condition n"L 1 {(a,&J r G 



£f(a,b) T = 0} = {0} is 

□ 



Corollary 15. If the set {pi, . . . ,p m } is invariant under the reflection in the 
line given by x = x, then the unique line with minimal algebraic L°° -distance to 
{pi, . . . ,p m } is the graph of the linear function 

y{x) = b where £ ( Vj ~ b^ 1 = £ (fc, - y^ 1 . 

Vj>bo Vj<bo 

Proof. Since there is a unique optimal linear function, the proof is similar to 
that of Corollary [I] The condition on bo follows from equation (12). □ 



Remark 7. Theorem [8] remains true if the condition Xj ^ Xk for all j ^ k is 
weakened to the existence of indices j ^ k such that Xj ^ Xk, since even in that 
cases n™ i{(«, b) T G R 2 : £j(a, b) T = 0} = {0}. 

However, if x± = . . . = x m = 5, then the values of f(a, b — ax) are indepen- 
dent from a and there exists a unique 6o6lR such that /(0, bo) < /(0, b) for all 
b G K. Thus, there are infinitely many linear functions with minimal algebraic 
L p -distance to {pi, . . . ,p m }- These are exactly y(x) — a(x — x) +bo with a G R. 



5.2 Minimal geometric //-distance 

Given pairwise distinct points pj G R 2 , j = 1, . . . , to, we want to determine the 
lines given by g = {q G R 2 : (q, n) = c}, c G K, n G $ , with minimal geometric 
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L p -distance to the set {pi 7 . . . ,p m }, i-e., the minimum of the function 

m 

/:KxS^K, f(c,n) = £ \c - ( Pj ,n)\P. 

3 = 1 

As for the investigation of the geometric L 1 -, L 2 - and L°°-distance we decompose 
the index set, this means J + := {j : (pj, n) > c}, J := {j : (pj, n) = c}, J_ := 
{j : (pj 7 n) < c} (see Figure [8]). Using the parametrization n(i) = (cost, sin t) T 
we obtain the following partial derivations of the function /(c, n(t)): 

f c (c,n)= P J2 Ic-fri^r 1 -^ Ic-fo.n)!*- 1 
ieJ- iGJ+ 

/ t (c,n)=p2j l c_ (P3:«)I p_1 pJ^-P 2J l c_ (Pj' n )l P_1 pJ Jn 

If /(<?) is a local minimum and the line g is given by (q, n) = c with c G R and 
n G 5 1 , then 

X; ic-^r^ ic-^r 1 (u) 

and 

X; |c- ^,n)rVJJn = ]T |c- ^,n)rVJJn. (15) 
jeJ- je.r+ 



The equations ( 14 ) and (151 are polynomials in c and n respectively real analytic 
expressions in c and t if p = 2n with n G N. Except for p = 2, there is no general 
explicit formula of the solutions of the equation df = (0,0). 

Theorem 9. Let pj G M. 2 , j = 1, . . . , m be pairwise distinct points. 

For any p > 1 there exists a line with minimal geometric LP -distance to the 
set {pi, . . . ,Pm\ ■ If the line g = {q G R 2 : (q, n) = c} with eel and n G S 11 
Aas minimal geometric LP -distance to the set {p\, . . . ,p m }, then c and n satisfy 
the equations |I^] ) and 

If /( c 0j n ') — /( c i: n ') = m i n {/(c, n) : c G R, n G S 1 }, then cq = c%. 

Proof. Applying Lemma |14| and Lemma 15 with rjj = —(pj,n) and £j = (1) it 
follows that for any n G S 1 there exists a unique c„ G K such that f(c n ,n) — 
min{/(c, n) : c G RJ. Since the function c i— > f(c,n) is continuously differen- 
tiable, equation ( 14 1 yields J2jeJ- l c " ~ (Pj, "->l p_1 = X^eJ+ l c « ~ (Pj^ n )\ P ~ 1 - 
In particular, J + , J_ ^ {1, . . . , m} for any line g = {q : c n = (q, n)}. Thus, 

a := min (Pj,n) < c„ < max (Pji n ) ='■ b 

j—l,...,m,n^S 1 j — l....,m, n^S 1 

for all n G S 1 . The continuous function / : R x S* 1 — > R, (c, n) i-> f(c,n), 
admits a global minimum on the compact set [a, 6] x S 1 . □ 

Remark 8. Theorem [9] remains true if the condition Xj ^ a;*, for all j ^ k is 
omitted. 
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Remark 9. If the set 

M := {(c , n ) : /(c , n ) < f(c, n)VceR,n€ S 1 } ClxS 1 

is infinite, then M has an accumulation point, because M C [a, 6] x S 1 . If 
p = 2n with ii e N, then the existence of such a limit point in M implies 
that for any n £ S 1 there exists c„ £ K such that (c n ,n) £ M, since / is a 
polynomial in c and n. If p ^ 2N, then, in general, the function /(c, n) is not 
real analytic. We denote the set of normal vectors of optimal lines by N, i.e., 
N := {n £ S 1 : R x {n} n M 0}. If p £ 2N, then TV is finite or N = S 1 . Is 
this fact true for any p > 1? 

The lines with minimal geometric L p -distance to the vertices of an equilateral 
triangle are identified in [7j by taking advantage of the symmetries of the point 
set for all 1 < p < oo. In this simplest nontrivial situation the set M consists of 
three optimal lines for p > 1 with p ^ 2, 4/3 and N = S 1 for p = 4/3 and p = 2. 



A Convexity 

A subset K C M n is called convex if the line segment {tx± + (l — t)xo : < t < 1} 
is contained in K for any x$,x\ £ K. The convex hull H(K) of a set if C M" 
is given by H(K) := {x = tx% + (1 — t)a;o : cco,a;i S if, < t < 1}. A subset 
K C K ra is convex if and only if K — H(K). Convex sets are connected. 

Let K C K n be a convex set. A function / : K — > K. is called convex if the 
inequality 

/(tax + (1 - t)x ) < tf{x x ) + (1 - t)f(x ) (16) 
holds for all Xq, xi £ K and all < t < 1. The function / is called strictly convex 



if the inequality (16) is strict for all t £ (0, 1) and x ^ Xi. If K ^ C K 2 C R n 
are two convex sets and the function / : i^2 — > K is convex, then the restriction 
/| if j : K\ — > K is convex. 

Lemma 16. Lei / : M n — > R be a convex function. The set 

M := {x £ W : f(x) < f(y) for all y £ R n } 

is convex. If f is strictly convex, then M contains at most one element. 

Proof. The following inequality holds for all < t < 1 and xo,xi £ R n : 

f{t Xl + (1 - t)x ) < t/(xi) + (1 - t)f(x ) < max{/(x ), f( Xl )} 

If Xq,Xi £ M, then f(txi + (1 — t)xo) = f(x ) = f(xi), i.e., txi + (l — t)x Q £ M. 
If / is strictly convex, xo ^ x\ £ R™ and < t < 1, then 

f(t Xl + (1 - t)a:o) < t/(a:i) + (1 - t)/(«o) < max{/(x ), f(xx)}. 
Consequently, {xo, x{\ (£ M. □ 
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Lemma 17. For any £ € R™ £/ie linear function y : R™ — > R defined by 
y(x) = £ T x is convex. 

Proof. It holds + (1 - t)x ) = £ T (tai + (1 - t)x ) = t£ T xi + (1 - t)£ T iro = 

*y(a;i) + (1 - t)y(x ). □ 

Lemma 18. If f,g : R™ D X — > R are convex functions on a convex set K, 
then f + g and max{/, g} are convex functions on K. 

If f or g are additionally strictly convex, then f + g is strictly convex. If f 
and g are strictly convex, then max{/, g} is strictly convex. 



Proof. Applying the inequality ( 16 ) to / and g we obtain 

(/ + g)(txi + (l - t)x ) = f(txi + (l - t)x ) + g{tx x + (l - t)x ) 

< tf{xx) + (1 - t)f(xo) + tg( Xl ) + (1 - t)g(x ) 
= t(f + g)(x 1 ) + (l-t)(f + g)(x ). 

Since t > and 1 — t > 0, it follows that 

max{/,g}(tei + (1 - t)x Q ) = max{/(txi + (1 - t)x ),g(tx 1 + (1 - i)af )} 

< m&x{tf( Xl ) + (1 - t)f(x ),tg{xi) + (1 - t)ff(*o)} 
<tmax{/(a;i),5(xi)} + (1 - t) max{/(i ), 5(^0)} 
=tmax{/,g}(xi) + (1 - t) max{/,3}(x ). 

□ 

Corollary 16. The function R — > R defined by x 1— > |x| is convex. 

Proof. \x\ — max{i, — x} □ 

Lemma 19. Lei J\T/ C R and lf g C R™ oe convex sets and f : Kf — > R and 
<7 : -ftTg — >• R &e convex functions satisfying g{K g ) C -ftTj. // / is an increasing 
function on Kf, then fog: K g —¥ R is a convex function. 

Proof. The inequality g(txi + (1 — t)x ) < tg(xx) + (1 — t)g(xo) holds for all 
Xo, a;i G iiTg and for all < t < 1. The monotony and the convexity of / imply 

(/ o g){tx x + (1 - t)a:o) = + (1 - t)x )) < f{tg{x{) + (1 - i)<?(z )) 

< */GK*i)) + (1 - t)f(g(x )) 

= t(fog)(x 1 ) + (l-t)(fog)(x ). 

□ 

Lemma 20. Let K C R 6e a convex set. If f : K — > R is a convex twice 
continuously differentiable function, then f"{x) > /or aZ/ ieR. 
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Proof. For any a G K we consider the function h(x) := f(x) — (x—a)f'(a) — f(a). 
Now ft(a) = 0, ft'(a) = 0, h"(a) = f"(a). The function / is convex if and only 
if h is convex. The convexity of ft implies 

h(a) = h ^ — ^ H ~ 2p l ^ a ~ e ) ^ ^ a ^ — max ((M° — e),h(a + e)) 

for all e. Hence, ft(a) is not a strict local maximum and ft" (a) > 0. □ 

Lemma 21. Let if C R &e a convex set and f : K — > M &e a convex function. 
U f"i x ) > / or a ^ x £ K, then f is convex. If f"(x) > /or a/Z x £ K, then 
f is strictly convex. 

Proof. For any xo,xi € if and < £ < 1 we set x t := tx\ + (1 — t)xo and 
consider the function h(x) := /(x) — (x — x t )f'(x t ) — f(x t ). Now ft(xi) = 0, 
h'{x t ) = and ft" (a;) = f"{x). 

If h"(x) > for all x £ K, then ft' (a;) is increasing. There exist a, b E K such 
that a < x t < b, ft' (a:) = for all a < x < b, h'(x) < for all x < a and ft' (a;) > 
for all x > b. Hence, h(x) > for all x £ K . Now tft(xi) + (1 - t)h(x ) > 
implies 

< t(f( Xl ) - ( Xl - x t )f'(x t ) - f(x t )) 

+ (1 - t)(f(x ) - (x - x t )f'(x t ) - f(x t )) = tf( Xl ) + (1 - t)f(x ) - f(x t ) 

If ft" (a;) > for all x £ K , then the inequality above is strict for < t < 1. □ 

Corollary 17. For any p > 1 ifte function f : [0, oo) — > R gwen by x ^ x p is 
strictly convex. 

Proof. It holds /"(x) = p(p - 1).tP- 2 > for all x > 0. If x = 0, x x > and 
< t < 1, then 

/(te +(l-t)*i) = f((l-t) Xl ) = (l-ty X \ < (l-t)f( Xl ) = tf(x ) + (l-t)f(x 1 ), 
since < 1 - t < 1 and (1 - t) p < (1 - t). □ 

Corollary 18. Let K C K" be a convex set. If f : K — > R is convex and twice 
continuously differentiable, then Hf(x) > for all x e K . 

Proof. It holds v T H f (x)v = ft"(0) > for all x £ K and for all v £ R™, since 
h(t) := /(a; + to) is a convex function. □ 

Corollary 19. Let K C R™ &e a convex set and f : K — »■ E 6e a differentiable 
function. If Hf(x) > /or a// x e if 7 iften / is convex. If Hf(x) > /or aZZ 
x £ K, then f is strictly convex. 

Corollary 20. For any £ € K™ and ant/ p > 1 the function R n — ► R defined by 
x i ^ |£ T x| p * s convex. 
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